Overview

Dataset statistics

Number of variables37
Number of observations23436
Missing cells352
Missing cells (%)< 0.1%
Duplicate rows15
Duplicate rows (%)0.1%
Total size in memory28.9 MiB
Average record size in memory1.3 KiB

Variable types

CAT19
NUM15
UNSUPPORTED3

Reproduction

Analysis started2020-10-05 02:29:54.575902
Analysis finished2020-10-05 02:30:37.868888
Duration43.29 seconds
Versionpandas-profiling v2.8.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml

Warnings

Dataset has 15 (0.1%) duplicate rows Duplicates
EmployeeNumber has a high cardinality: 23366 distinct values High cardinality
HourlyRate has a high cardinality: 73 distinct values High cardinality
MonthlyIncome has a high cardinality: 1351 distinct values High cardinality
StandardHours is highly correlated with EnvironmentSatisfactionHigh correlation
EnvironmentSatisfaction is highly correlated with StandardHoursHigh correlation
Over18 is highly correlated with Gender and 9 other fieldsHigh correlation
Gender is highly correlated with Over18 and 1 other fieldsHigh correlation
HourlyRate is highly correlated with Over18 and 1 other fieldsHigh correlation
JobRole is highly correlated with Over18 and 1 other fieldsHigh correlation
JobSatisfaction is highly correlated with Over18High correlation
MaritalStatus is highly correlated with Over18High correlation
OverTime is highly correlated with Over18High correlation
PercentSalaryHike is highly correlated with Over18 and 1 other fieldsHigh correlation
PerformanceRating is highly correlated with Over18 and 1 other fieldsHigh correlation
StandardHours is highly correlated with Gender and 6 other fieldsHigh correlation
Employee Source is highly correlated with Over18 and 1 other fieldsHigh correlation
EnvironmentSatisfaction is highly skewed (γ1 = 108.2353204) Skewed
NumCompaniesWorked is highly skewed (γ1 = 144.5986932) Skewed
StockOptionLevel is highly skewed (γ1 = 30.4052657) Skewed
EmployeeNumber is uniformly distributed Uniform
DistanceFromHome is an unsupported type, check if it needs cleaning or further analysis Unsupported
EmployeeCount is an unsupported type, check if it needs cleaning or further analysis Unsupported
Application ID is an unsupported type, check if it needs cleaning or further analysis Unsupported
NumCompaniesWorked has 3176 (13.6%) zeros Zeros
StockOptionLevel has 10066 (43.0%) zeros Zeros
TrainingTimesLastYear has 871 (3.7%) zeros Zeros
YearsAtCompany has 740 (3.2%) zeros Zeros
YearsInCurrentRole has 3925 (16.7%) zeros Zeros
YearsSinceLastPromotion has 9271 (39.6%) zeros Zeros
YearsWithCurrManager has 4197 (17.9%) zeros Zeros

Variables

Age
Real number (ℝ≥0)

Distinct count43
Unique (%)0.2%
Missing3
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean36.93667050740409
Minimum18.0
Maximum60.0
Zeros0
Zeros (%)0.0%
Memory size183.2 KiB

Quantile statistics

Minimum18
5-th percentile24
Q130
median36
Q343
95-th percentile54
Maximum60
Range42
Interquartile range (IQR)13

Descriptive statistics

Standard deviation9.137431829
Coefficient of variation (CV)0.2473810363
Kurtosis-0.40829426
Mean36.93667051
Median Absolute Deviation (MAD)6
Skewness0.4102218486
Sum865537
Variance83.49266042
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
3412305.2%
 
3512275.2%
 
3611064.7%
 
3110854.6%
 
2910754.6%
 
329844.2%
 
309524.1%
 
389454.0%
 
339274.0%
 
408933.8%
 
378063.4%
 
277733.3%
 
287533.2%
 
427373.1%
 
396612.8%
 
456502.8%
 
416482.8%
 
266122.6%
 
445282.3%
 
465202.2%
 
435122.2%
 
504812.1%
 
254161.8%
 
244161.8%
 
493851.6%
 
Other values (18)411117.5%
 
ValueCountFrequency (%) 
181270.5%
 
191430.6%
 
201750.7%
 
212130.9%
 
222571.1%
 
232231.0%
 
244161.8%
 
254161.8%
 
266122.6%
 
277733.3%
 
ValueCountFrequency (%) 
60800.3%
 
591610.7%
 
582241.0%
 
57630.3%
 
562231.0%
 
553491.5%
 
542871.2%
 
533091.3%
 
522881.2%
 
513021.3%
 

Attrition
Categorical

Distinct count2
Unique (%)< 0.1%
Missing13
Missing (%)0.1%
Memory size183.2 KiB
Current employee
19714
Voluntary Resignation
 
3709
ValueCountFrequency (%) 
Current employee1971484.1%
 
Voluntary Resignation370915.8%
 
(Missing)130.1%
 

Length

Max length21
Median length16
Mean length16.78409285
Min length3

Overview of Unicode Properties

Unique unicode characters18
Unique unicode categories (?)3
Unique unicode scripts (?)2
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
e8256521.0%
 
r4313711.0%
 
n308677.8%
 
o271326.9%
 
t271326.9%
 
l234236.0%
 
u234236.0%
 
y234236.0%
 
234236.0%
 
C197145.0%
 
m197145.0%
 
p197145.0%
 
a74311.9%
 
i74181.9%
 
V37090.9%
 
R37090.9%
 
s37090.9%
 
g37090.9%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter34279787.1%
 
Uppercase Letter271326.9%
 
Space Separator234236.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
C1971472.7%
 
V370913.7%
 
R370913.7%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
e8256524.1%
 
r4313712.6%
 
n308679.0%
 
o271327.9%
 
t271327.9%
 
l234236.8%
 
u234236.8%
 
y234236.8%
 
m197145.8%
 
p197145.8%
 
a74312.2%
 
i74182.2%
 
s37091.1%
 
g37091.1%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
23423100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin36992994.0%
 
Common234236.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
e8256522.3%
 
r4313711.7%
 
n308678.3%
 
o271327.3%
 
t271327.3%
 
l234236.3%
 
u234236.3%
 
y234236.3%
 
C197145.3%
 
m197145.3%
 
p197145.3%
 
a74312.0%
 
i74182.0%
 
V37091.0%
 
R37091.0%
 
s37091.0%
 
g37091.0%
 

Most frequent Common characters

ValueCountFrequency (%) 
23423100.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII393352100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
e8256521.0%
 
r4313711.0%
 
n308677.8%
 
o271326.9%
 
t271326.9%
 
l234236.0%
 
u234236.0%
 
y234236.0%
 
234236.0%
 
C197145.0%
 
m197145.0%
 
p197145.0%
 
a74311.9%
 
i74181.9%
 
V37090.9%
 
R37090.9%
 
s37090.9%
 
g37090.9%
 

BusinessTravel
Categorical

Distinct count3
Unique (%)< 0.1%
Missing8
Missing (%)< 0.1%
Memory size183.2 KiB
Travel_Rarely
16620
Travel_Frequently
4413
Non-Travel
 
2395
ValueCountFrequency (%) 
Travel_Rarely1662070.9%
 
Travel_Frequently441318.8%
 
Non-Travel239510.2%
 
(Missing)8< 0.1%
 

Length

Max length17
Median length13
Mean length13.44320703
Min length3

Overview of Unicode Properties

Unique unicode characters17
Unique unicode categories (?)4
Unique unicode scripts (?)2
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
e4887415.5%
 
r4446114.1%
 
l4446114.1%
 
a4005612.7%
 
T234287.4%
 
v234287.4%
 
_210336.7%
 
y210336.7%
 
R166205.3%
 
n68242.2%
 
F44131.4%
 
q44131.4%
 
u44131.4%
 
t44131.4%
 
N23950.8%
 
o23950.8%
 
-23950.8%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter24477177.7%
 
Uppercase Letter4685614.9%
 
Connector Punctuation210336.7%
 
Dash Punctuation23950.8%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
T2342850.0%
 
R1662035.5%
 
F44139.4%
 
N23955.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
e4887420.0%
 
r4446118.2%
 
l4446118.2%
 
a4005616.4%
 
v234289.6%
 
y210338.6%
 
n68242.8%
 
q44131.8%
 
u44131.8%
 
t44131.8%
 
o23951.0%
 

Most frequent Connector Punctuation characters

ValueCountFrequency (%) 
_21033100.0%
 

Most frequent Dash Punctuation characters

ValueCountFrequency (%) 
-2395100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin29162792.6%
 
Common234287.4%
 

Most frequent Latin characters

ValueCountFrequency (%) 
e4887416.8%
 
r4446115.2%
 
l4446115.2%
 
a4005613.7%
 
T234288.0%
 
v234288.0%
 
y210337.2%
 
R166205.7%
 
n68242.3%
 
F44131.5%
 
q44131.5%
 
u44131.5%
 
t44131.5%
 
N23950.8%
 
o23950.8%
 

Most frequent Common characters

ValueCountFrequency (%) 
_2103389.8%
 
-239510.2%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII315055100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
e4887415.5%
 
r4446114.1%
 
l4446114.1%
 
a4005612.7%
 
T234287.4%
 
v234287.4%
 
_210336.7%
 
y210336.7%
 
R166205.3%
 
n68242.2%
 
F44131.4%
 
q44131.4%
 
u44131.4%
 
t44131.4%
 
N23950.8%
 
o23950.8%
 
-23950.8%
 

DailyRate
Real number (ℝ≥0)

Distinct count883
Unique (%)3.8%
Missing12
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean801.8287653688525
Minimum102.0
Maximum1499.0
Zeros0
Zeros (%)0.0%
Memory size183.2 KiB

Quantile statistics

Minimum102
5-th percentile164
Q1465
median802
Q31157
95-th percentile1423
Maximum1499
Range1397
Interquartile range (IQR)692

Descriptive statistics

Standard deviation403.2061664
Coefficient of variation (CV)0.502858196
Kurtosis-1.204785263
Mean801.8287654
Median Absolute Deviation (MAD)344
Skewness-0.004644039599
Sum18782037
Variance162575.2127
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
691990.4%
 
408810.3%
 
1329800.3%
 
329790.3%
 
1082790.3%
 
530790.3%
 
427680.3%
 
334670.3%
 
827650.3%
 
950650.3%
 
1125650.3%
 
217640.3%
 
906640.3%
 
350640.3%
 
1157640.3%
 
1146640.3%
 
117640.3%
 
1469640.3%
 
465640.3%
 
688640.3%
 
147640.3%
 
977640.3%
 
589640.3%
 
575640.3%
 
921640.3%
 
Other values (858)2170192.6%
 
ValueCountFrequency (%) 
102160.1%
 
103180.1%
 
104160.1%
 
105150.1%
 
106160.1%
 
107160.1%
 
109150.1%
 
111470.2%
 
115160.1%
 
116320.1%
 
ValueCountFrequency (%) 
1499150.1%
 
1498160.1%
 
1496320.1%
 
1495480.2%
 
1492160.1%
 
1490620.3%
 
1488160.1%
 
1485460.2%
 
1482160.1%
 
1480320.1%
 

Department
Categorical

Distinct count4
Unique (%)< 0.1%
Missing11
Missing (%)< 0.1%
Memory size183.2 KiB
Research & Development
15286
Sales
7119
Human Resources
 
1019
1296
 
1
ValueCountFrequency (%) 
Research & Development1528665.2%
 
Sales711930.4%
 
Human Resources10194.3%
 
12961< 0.1%
 
(Missing)11< 0.1%
 

Length

Max length22
Median length22
Mean length16.52197474
Min length3

Overview of Unicode Properties

Unique unicode characters24
Unique unicode categories (?)5
Unique unicode scripts (?)2
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
e8558722.1%
 
315918.2%
 
s244436.3%
 
a234356.1%
 
l224055.8%
 
n163274.2%
 
R163054.2%
 
r163054.2%
 
c163054.2%
 
o163054.2%
 
m163054.2%
 
h152863.9%
 
&152863.9%
 
D152863.9%
 
v152863.9%
 
p152863.9%
 
t152863.9%
 
S71191.8%
 
u20380.5%
 
H10190.3%
 
11< 0.1%
 
21< 0.1%
 
91< 0.1%
 
61< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter30059977.6%
 
Uppercase Letter3972910.3%
 
Space Separator315918.2%
 
Other Punctuation152863.9%
 
Decimal Number4< 0.1%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
R1630541.0%
 
D1528638.5%
 
S711917.9%
 
H10192.6%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
e8558728.5%
 
s244438.1%
 
a234357.8%
 
l224057.5%
 
n163275.4%
 
r163055.4%
 
c163055.4%
 
o163055.4%
 
m163055.4%
 
h152865.1%
 
v152865.1%
 
p152865.1%
 
t152865.1%
 
u20380.7%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
31591100.0%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
&15286100.0%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
1125.0%
 
2125.0%
 
9125.0%
 
6125.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin34032887.9%
 
Common4688112.1%
 

Most frequent Latin characters

ValueCountFrequency (%) 
e8558725.1%
 
s244437.2%
 
a234356.9%
 
l224056.6%
 
n163274.8%
 
R163054.8%
 
r163054.8%
 
c163054.8%
 
o163054.8%
 
m163054.8%
 
h152864.5%
 
D152864.5%
 
v152864.5%
 
p152864.5%
 
t152864.5%
 
S71192.1%
 
u20380.6%
 
H10190.3%
 

Most frequent Common characters

ValueCountFrequency (%) 
3159167.4%
 
&1528632.6%
 
11< 0.1%
 
21< 0.1%
 
91< 0.1%
 
61< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII387209100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
e8558722.1%
 
315918.2%
 
s244436.3%
 
a234356.1%
 
l224055.8%
 
n163274.2%
 
R163054.2%
 
r163054.2%
 
c163054.2%
 
o163054.2%
 
m163054.2%
 
h152863.9%
 
&152863.9%
 
D152863.9%
 
v152863.9%
 
p152863.9%
 
t152863.9%
 
S71191.8%
 
u20380.5%
 
H10190.3%
 
11< 0.1%
 
21< 0.1%
 
91< 0.1%
 
61< 0.1%
 

DistanceFromHome
Unsupported

REJECTED
UNSUPPORTED

Missing9
Missing (%)< 0.1%
Memory size1.1 MiB

Education
Real number (ℝ≥0)

Distinct count6
Unique (%)< 0.1%
Missing12
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean2.9100495218579234
Minimum1.0
Maximum6.0
Zeros0
Zeros (%)0.0%
Memory size183.2 KiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q34
95-th percentile4
Maximum6
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.024930959
Coefficient of variation (CV)0.3522039579
Kurtosis-0.5648828904
Mean2.910049522
Median Absolute Deviation (MAD)1
Skewness-0.2842605381
Sum68165
Variance1.050483471
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
3909838.8%
 
4632127.0%
 
2451719.3%
 
1272211.6%
 
57653.3%
 
61< 0.1%
 
(Missing)120.1%
 
ValueCountFrequency (%) 
1272211.6%
 
2451719.3%
 
3909838.8%
 
4632127.0%
 
57653.3%
 
61< 0.1%
 
ValueCountFrequency (%) 
61< 0.1%
 
57653.3%
 
4632127.0%
 
3909838.8%
 
2451719.3%
 
1272211.6%
 

EducationField
Categorical

Distinct count8
Unique (%)< 0.1%
Missing9
Missing (%)< 0.1%
Memory size183.2 KiB
Life Sciences
9701
Medical
7337
Marketing
2541
Technical Degree
2089
Other
 
1311
Other values (3)
 
448
ValueCountFrequency (%) 
Life Sciences970141.4%
 
Medical733731.3%
 
Marketing254110.8%
 
Technical Degree20898.9%
 
Other13115.6%
 
Human Resources4461.9%
 
Test1< 0.1%
 
31< 0.1%
 
(Missing)9< 0.1%
 

Length

Max length16
Median length13
Mean length10.5411333
Min length1

Overview of Unicode Properties

Unique unicode characters27
Unique unicode categories (?)4
Unique unicode scripts (?)2
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
e4954120.1%
 
i3136912.7%
 
c3136312.7%
 
n147956.0%
 
a124225.0%
 
122365.0%
 
s105944.3%
 
M98784.0%
 
L97013.9%
 
f97013.9%
 
S97013.9%
 
l94263.8%
 
d73373.0%
 
r63872.6%
 
g46301.9%
 
t38531.6%
 
h34001.4%
 
k25411.0%
 
T20900.8%
 
D20890.8%
 
O13110.5%
 
u8920.4%
 
H4460.2%
 
m4460.2%
 
R4460.2%
 
Other values (2)4470.2%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter19914380.6%
 
Uppercase Letter3566214.4%
 
Space Separator122365.0%
 
Decimal Number1< 0.1%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
M987827.7%
 
L970127.2%
 
S970127.2%
 
T20905.9%
 
D20895.9%
 
O13113.7%
 
H4461.3%
 
R4461.3%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
e4954124.9%
 
i3136915.8%
 
c3136315.7%
 
n147957.4%
 
a124226.2%
 
s105945.3%
 
f97014.9%
 
l94264.7%
 
d73373.7%
 
r63873.2%
 
g46302.3%
 
t38531.9%
 
h34001.7%
 
k25411.3%
 
u8920.4%
 
m4460.2%
 
o4460.2%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
12236100.0%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
31100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin23480595.0%
 
Common122375.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
e4954121.1%
 
i3136913.4%
 
c3136313.4%
 
n147956.3%
 
a124225.3%
 
s105944.5%
 
M98784.2%
 
L97014.1%
 
f97014.1%
 
S97014.1%
 
l94264.0%
 
d73373.1%
 
r63872.7%
 
g46302.0%
 
t38531.6%
 
h34001.4%
 
k25411.1%
 
T20900.9%
 
D20890.9%
 
O13110.6%
 
u8920.4%
 
H4460.2%
 
m4460.2%
 
R4460.2%
 
o4460.2%
 

Most frequent Common characters

ValueCountFrequency (%) 
12236> 99.9%
 
31< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII247042100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
e4954120.1%
 
i3136912.7%
 
c3136312.7%
 
n147956.0%
 
a124225.0%
 
122365.0%
 
s105944.3%
 
M98784.0%
 
L97013.9%
 
f97013.9%
 
S97013.9%
 
l94263.8%
 
d73373.0%
 
r63872.6%
 
g46301.9%
 
t38531.6%
 
h34001.4%
 
k25411.0%
 
T20900.8%
 
D20890.8%
 
O13110.5%
 
u8920.4%
 
H4460.2%
 
m4460.2%
 
R4460.2%
 
Other values (2)4470.2%
 

EmployeeCount
Unsupported

REJECTED
UNSUPPORTED

Missing5
Missing (%)< 0.1%
Memory size1.1 MiB

EmployeeNumber
Categorical

HIGH CARDINALITY
UNIFORM

Distinct count23366
Unique (%)99.7%
Missing1
Missing (%)< 0.1%
Memory size183.2 KiB
23244
 
7
1
 
7
6325
 
6
10442
 
5
10024
 
4
Other values (23361)
23406
ValueCountFrequency (%) 
232447< 0.1%
 
17< 0.1%
 
63256< 0.1%
 
104425< 0.1%
 
100244< 0.1%
 
95684< 0.1%
 
126864< 0.1%
 
354< 0.1%
 
120784< 0.1%
 
134113< 0.1%
 
170313< 0.1%
 
114703< 0.1%
 
8162< 0.1%
 
20922< 0.1%
 
18262< 0.1%
 
72< 0.1%
 
35382< 0.1%
 
203082< 0.1%
 
42< 0.1%
 
1262< 0.1%
 
8192< 0.1%
 
23962< 0.1%
 
158482< 0.1%
 
215672< 0.1%
 
12942< 0.1%
 
Other values (23341)2335599.7%
 

Length

Max length7
Median length5
Mean length4.525942994
Min length1

Overview of Unicode Properties

Unique unicode characters21
Unique unicode categories (?)3
Unique unicode scripts (?)2
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
12005618.9%
 
21353112.8%
 
395779.0%
 
490408.5%
 
589938.5%
 
089918.5%
 
689908.5%
 
889658.5%
 
789628.4%
 
989478.4%
 
T5< 0.1%
 
E2< 0.1%
 
S2< 0.1%
 
n2< 0.1%
 
e1< 0.1%
 
s1< 0.1%
 
t1< 0.1%
 
I1< 0.1%
 
N1< 0.1%
 
G1< 0.1%
 
a1< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number106052> 99.9%
 
Uppercase Letter12< 0.1%
 
Lowercase Letter6< 0.1%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
12005618.9%
 
21353112.8%
 
395779.0%
 
490408.5%
 
589938.5%
 
089918.5%
 
689908.5%
 
889658.5%
 
789628.5%
 
989478.4%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
T541.7%
 
E216.7%
 
S216.7%
 
I18.3%
 
N18.3%
 
G18.3%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
n233.3%
 
e116.7%
 
s116.7%
 
t116.7%
 
a116.7%
 

Most occurring scripts

ValueCountFrequency (%) 
Common106052> 99.9%
 
Latin18< 0.1%
 

Most frequent Common characters

ValueCountFrequency (%) 
12005618.9%
 
21353112.8%
 
395779.0%
 
490408.5%
 
589938.5%
 
089918.5%
 
689908.5%
 
889658.5%
 
789628.5%
 
989478.4%
 

Most frequent Latin characters

ValueCountFrequency (%) 
T527.8%
 
E211.1%
 
S211.1%
 
n211.1%
 
e15.6%
 
s15.6%
 
t15.6%
 
I15.6%
 
N15.6%
 
G15.6%
 
a15.6%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII106070100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
12005618.9%
 
21353112.8%
 
395779.0%
 
490408.5%
 
589938.5%
 
089918.5%
 
689908.5%
 
889658.5%
 
789628.4%
 
989478.4%
 
T5< 0.1%
 
E2< 0.1%
 
S2< 0.1%
 
n2< 0.1%
 
e1< 0.1%
 
s1< 0.1%
 
t1< 0.1%
 
I1< 0.1%
 
N1< 0.1%
 
G1< 0.1%
 
a1< 0.1%
 

Application ID
Unsupported

REJECTED
UNSUPPORTED

Missing3
Missing (%)< 0.1%
Memory size1.2 MiB

EnvironmentSatisfaction
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED

Distinct count6
Unique (%)< 0.1%
Missing9
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean13.681777436291458
Minimum1.0
Maximum129588.0
Zeros0
Zeros (%)0.0%
Memory size183.2 KiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q34
95-th percentile4
Maximum129588
Range129587
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1186.544372
Coefficient of variation (CV)86.72443166
Kurtosis11714.8661
Mean13.68177744
Median Absolute Deviation (MAD)1
Skewness108.2353204
Sum320523
Variance1407887.547
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
3719630.7%
 
4711030.3%
 
1458019.5%
 
2453919.4%
 
1295881< 0.1%
 
1272491< 0.1%
 
(Missing)9< 0.1%
 
ValueCountFrequency (%) 
1458019.5%
 
2453919.4%
 
3719630.7%
 
4711030.3%
 
1272491< 0.1%
 
1295881< 0.1%
 
ValueCountFrequency (%) 
1295881< 0.1%
 
1272491< 0.1%
 
4711030.3%
 
3719630.7%
 
2453919.4%
 
1458019.5%
 

Gender
Categorical

HIGH CORRELATION

Distinct count4
Unique (%)< 0.1%
Missing10
Missing (%)< 0.1%
Memory size183.2 KiB
Male
14056
Female
9368
2
 
1
1
 
1
ValueCountFrequency (%) 
Male1405660.0%
 
Female936840.0%
 
21< 0.1%
 
11< 0.1%
 
(Missing)10< 0.1%
 

Length

Max length6
Median length4
Mean length4.798771121
Min length1

Overview of Unicode Properties

Unique unicode characters9
Unique unicode categories (?)3
Unique unicode scripts (?)2
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
e3279229.2%
 
a2343420.8%
 
l2342420.8%
 
M1405612.5%
 
F93688.3%
 
m93688.3%
 
n20< 0.1%
 
11< 0.1%
 
21< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter8903879.2%
 
Uppercase Letter2342420.8%
 
Decimal Number2< 0.1%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
M1405660.0%
 
F936840.0%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
e3279236.8%
 
a2343426.3%
 
l2342426.3%
 
m936810.5%
 
n20< 0.1%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
1150.0%
 
2150.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin112462> 99.9%
 
Common2< 0.1%
 

Most frequent Latin characters

ValueCountFrequency (%) 
e3279229.2%
 
a2343420.8%
 
l2342420.8%
 
M1405612.5%
 
F93688.3%
 
m93688.3%
 
n20< 0.1%
 

Most frequent Common characters

ValueCountFrequency (%) 
1150.0%
 
2150.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII112464100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
e3279229.2%
 
a2343420.8%
 
l2342420.8%
 
M1405612.5%
 
F93688.3%
 
m93688.3%
 
n20< 0.1%
 
11< 0.1%
 
21< 0.1%
 

HourlyRate
Categorical

HIGH CARDINALITY
HIGH CORRELATION

Distinct count73
Unique (%)0.3%
Missing9
Missing (%)< 0.1%
Memory size183.2 KiB
66
 
480
98
 
447
48
 
447
42
 
445
84
 
444
Other values (68)
21164
ValueCountFrequency (%) 
664802.0%
 
984471.9%
 
484471.9%
 
424451.9%
 
844441.9%
 
964361.9%
 
794311.8%
 
874171.8%
 
574161.8%
 
564101.7%
 
524091.7%
 
544061.7%
 
324041.7%
 
724041.7%
 
464001.7%
 
923971.7%
 
433831.6%
 
733831.6%
 
453791.6%
 
823771.6%
 
783711.6%
 
813661.6%
 
953651.6%
 
623631.5%
 
603631.5%
 
Other values (48)1328456.7%
 

Length

Max length6
Median length2
Mean length2.013227513
Min length2

Overview of Unicode Properties

Unique unicode characters17
Unique unicode categories (?)3
Unique unicode scripts (?)2
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
6579812.3%
 
4578912.3%
 
8576012.2%
 
7574412.2%
 
9564012.0%
 
5550211.7%
 
3495810.5%
 
227995.9%
 
026985.7%
 
124575.2%
 
n18< 0.1%
 
a11< 0.1%
 
e3< 0.1%
 
l2< 0.1%
 
M1< 0.1%
 
F1< 0.1%
 
m1< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number4714599.9%
 
Lowercase Letter350.1%
 
Uppercase Letter2< 0.1%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
6579812.3%
 
4578912.3%
 
8576012.2%
 
7574412.2%
 
9564012.0%
 
5550211.7%
 
3495810.5%
 
227995.9%
 
026985.7%
 
124575.2%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
n1851.4%
 
a1131.4%
 
e38.6%
 
l25.7%
 
m12.9%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
M150.0%
 
F150.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Common4714599.9%
 
Latin370.1%
 

Most frequent Common characters

ValueCountFrequency (%) 
6579812.3%
 
4578912.3%
 
8576012.2%
 
7574412.2%
 
9564012.0%
 
5550211.7%
 
3495810.5%
 
227995.9%
 
026985.7%
 
124575.2%
 

Most frequent Latin characters

ValueCountFrequency (%) 
n1848.6%
 
a1129.7%
 
e38.1%
 
l25.4%
 
M12.7%
 
F12.7%
 
m12.7%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII47182100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
6579812.3%
 
4578912.3%
 
8576012.2%
 
7574412.2%
 
9564012.0%
 
5550211.7%
 
3495810.5%
 
227995.9%
 
026985.7%
 
124575.2%
 
n18< 0.1%
 
a11< 0.1%
 
e3< 0.1%
 
l2< 0.1%
 
M1< 0.1%
 
F1< 0.1%
 
m1< 0.1%
 

JobInvolvement
Real number (ℝ≥0)

Distinct count6
Unique (%)< 0.1%
Missing9
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean2.7338114141802192
Minimum1.0
Maximum54.0
Zeros0
Zeros (%)0.0%
Memory size183.2 KiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q33
95-th percentile4
Maximum54
Range53
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.8368602102
Coefficient of variation (CV)0.3061148278
Kurtosis934.3160899
Mean2.733811414
Median Absolute Deviation (MAD)0
Skewness15.81635307
Sum64045
Variance0.7003350115
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
31385359.1%
 
2597325.5%
 
422809.7%
 
113195.6%
 
471< 0.1%
 
541< 0.1%
 
(Missing)9< 0.1%
 
ValueCountFrequency (%) 
113195.6%
 
2597325.5%
 
31385359.1%
 
422809.7%
 
471< 0.1%
 
541< 0.1%
 
ValueCountFrequency (%) 
541< 0.1%
 
471< 0.1%
 
422809.7%
 
31385359.1%
 
2597325.5%
 
113195.6%
 

JobLevel
Real number (ℝ≥0)

Distinct count5
Unique (%)< 0.1%
Missing7
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean2.064023219087456
Minimum1.0
Maximum5.0
Zeros0
Zeros (%)0.0%
Memory size183.2 KiB

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q33
95-th percentile4
Maximum5
Range4
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.105420801
Coefficient of variation (CV)0.5355660684
Kurtosis0.3934487693
Mean2.064023219
Median Absolute Deviation (MAD)1
Skewness1.023071
Sum48358
Variance1.221955146
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1864136.9%
 
2852636.4%
 
3347514.8%
 
416957.2%
 
510924.7%
 
(Missing)7< 0.1%
 
ValueCountFrequency (%) 
1864136.9%
 
2852636.4%
 
3347514.8%
 
416957.2%
 
510924.7%
 
ValueCountFrequency (%) 
510924.7%
 
416957.2%
 
3347514.8%
 
2852636.4%
 
1864136.9%
 

JobRole
Categorical

HIGH CORRELATION

Distinct count11
Unique (%)< 0.1%
Missing9
Missing (%)< 0.1%
Memory size183.2 KiB
Sales Executive
5111
Research Scientist
4634
Laboratory Technician
4162
Manufacturing Director
2376
Healthcare Representative
2104
Other values (6)
5040
ValueCountFrequency (%) 
Sales Executive511121.8%
 
Research Scientist463419.8%
 
Laboratory Technician416217.8%
 
Manufacturing Director237610.1%
 
Healthcare Representative21049.0%
 
Manager16006.8%
 
Sales Representative13065.6%
 
Research Director12875.5%
 
Human Resources8453.6%
 
41< 0.1%
 
51< 0.1%
 
(Missing)9< 0.1%
 

Length

Max length25
Median length18
Mean length18.10266257
Min length1

Overview of Unicode Properties

Unique unicode characters31
Unique unicode categories (?)4
Unique unicode scripts (?)2
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
e6207814.6%
 
a412489.7%
 
t335047.9%
 
c329787.8%
 
i321527.6%
 
r319067.5%
 
n235835.6%
 
s220725.2%
 
218255.1%
 
o128323.0%
 
h121872.9%
 
u115532.7%
 
S110512.6%
 
R101762.4%
 
l85212.0%
 
v85212.0%
 
E51111.2%
 
x51111.2%
 
L41621.0%
 
b41621.0%
 
y41621.0%
 
T41621.0%
 
M39760.9%
 
g39760.9%
 
D36630.9%
 
Other values (6)95822.3%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter35717784.2%
 
Uppercase Letter4525010.7%
 
Space Separator218255.1%
 
Decimal Number2< 0.1%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
S1105124.4%
 
R1017622.5%
 
E511111.3%
 
L41629.2%
 
T41629.2%
 
M39768.8%
 
D36638.1%
 
H29496.5%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
e6207817.4%
 
a4124811.5%
 
t335049.4%
 
c329789.2%
 
i321529.0%
 
r319068.9%
 
n235836.6%
 
s220726.2%
 
o128323.6%
 
h121873.4%
 
u115533.2%
 
l85212.4%
 
v85212.4%
 
x51111.4%
 
b41621.2%
 
y41621.2%
 
g39761.1%
 
p34101.0%
 
f23760.7%
 
m8450.2%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
21825100.0%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
5150.0%
 
4150.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin40242794.9%
 
Common218275.1%
 

Most frequent Latin characters

ValueCountFrequency (%) 
e6207815.4%
 
a4124810.2%
 
t335048.3%
 
c329788.2%
 
i321528.0%
 
r319067.9%
 
n235835.9%
 
s220725.5%
 
o128323.2%
 
h121873.0%
 
u115532.9%
 
S110512.7%
 
R101762.5%
 
l85212.1%
 
v85212.1%
 
E51111.3%
 
x51111.3%
 
L41621.0%
 
b41621.0%
 
y41621.0%
 
T41621.0%
 
M39761.0%
 
g39761.0%
 
D36630.9%
 
p34100.8%
 
Other values (3)61701.5%
 

Most frequent Common characters

ValueCountFrequency (%) 
21825> 99.9%
 
51< 0.1%
 
41< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII424254100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
e6207814.6%
 
a412489.7%
 
t335047.9%
 
c329787.8%
 
i321527.6%
 
r319067.5%
 
n235835.6%
 
s220725.2%
 
218255.1%
 
o128323.0%
 
h121872.9%
 
u115532.7%
 
S110512.6%
 
R101762.4%
 
l85212.0%
 
v85212.0%
 
E51111.2%
 
x51111.2%
 
L41621.0%
 
b41621.0%
 
y41621.0%
 
T41621.0%
 
M39760.9%
 
g39760.9%
 
D36630.9%
 
Other values (6)95822.3%
 

JobSatisfaction
Categorical

HIGH CORRELATION

Distinct count5
Unique (%)< 0.1%
Missing9
Missing (%)< 0.1%
Memory size183.2 KiB
4
7276
3
7088
1
4605
2
4456
Manager
 
2
ValueCountFrequency (%) 
4727631.0%
 
3708830.2%
 
1460519.6%
 
2445619.0%
 
Manager2< 0.1%
 
(Missing)9< 0.1%
 

Length

Max length7
Median length1
Mean length1.001280082
Min length1

Overview of Unicode Properties

Unique unicode characters10
Unique unicode categories (?)3
Unique unicode scripts (?)2
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
4727631.0%
 
3708830.2%
 
1460519.6%
 
2445619.0%
 
n200.1%
 
a130.1%
 
M2< 0.1%
 
g2< 0.1%
 
e2< 0.1%
 
r2< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number2342599.8%
 
Lowercase Letter390.2%
 
Uppercase Letter2< 0.1%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
4727631.1%
 
3708830.3%
 
1460519.7%
 
2445619.0%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
n2051.3%
 
a1333.3%
 
g25.1%
 
e25.1%
 
r25.1%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
M2100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Common2342599.8%
 
Latin410.2%
 

Most frequent Common characters

ValueCountFrequency (%) 
4727631.1%
 
3708830.3%
 
1460519.7%
 
2445619.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
n2048.8%
 
a1331.7%
 
M24.9%
 
g24.9%
 
e24.9%
 
r24.9%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII23466100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
4727631.0%
 
3708830.2%
 
1460519.6%
 
2445619.0%
 
n200.1%
 
a130.1%
 
M2< 0.1%
 
g2< 0.1%
 
e2< 0.1%
 
r2< 0.1%
 

MaritalStatus
Categorical

HIGH CORRELATION

Distinct count4
Unique (%)< 0.1%
Missing11
Missing (%)< 0.1%
Memory size183.2 KiB
Married
10709
Single
7504
Divorced
5210
4
 
2
ValueCountFrequency (%) 
Married1070945.7%
 
Single750432.0%
 
Divorced521022.2%
 
42< 0.1%
 
(Missing)11< 0.1%
 

Length

Max length8
Median length7
Mean length6.899726916
Min length1

Overview of Unicode Properties

Unique unicode characters15
Unique unicode categories (?)3
Unique unicode scripts (?)2
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
r2662816.5%
 
i2342314.5%
 
e2342314.5%
 
d159199.8%
 
a107206.6%
 
M107096.6%
 
n75264.7%
 
S75044.6%
 
g75044.6%
 
l75044.6%
 
D52103.2%
 
v52103.2%
 
o52103.2%
 
c52103.2%
 
42< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter13827785.5%
 
Uppercase Letter2342314.5%
 
Decimal Number2< 0.1%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
M1070945.7%
 
S750432.0%
 
D521022.2%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
r2662819.3%
 
i2342316.9%
 
e2342316.9%
 
d1591911.5%
 
a107207.8%
 
n75265.4%
 
g75045.4%
 
l75045.4%
 
v52103.8%
 
o52103.8%
 
c52103.8%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
42100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin161700> 99.9%
 
Common2< 0.1%
 

Most frequent Latin characters

ValueCountFrequency (%) 
r2662816.5%
 
i2342314.5%
 
e2342314.5%
 
d159199.8%
 
a107206.6%
 
M107096.6%
 
n75264.7%
 
S75044.6%
 
g75044.6%
 
l75044.6%
 
D52103.2%
 
v52103.2%
 
o52103.2%
 
c52103.2%
 

Most frequent Common characters

ValueCountFrequency (%) 
42100.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII161702100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
r2662816.5%
 
i2342314.5%
 
e2342314.5%
 
d159199.8%
 
a107206.6%
 
M107096.6%
 
n75264.7%
 
S75044.6%
 
g75044.6%
 
l75044.6%
 
D52103.2%
 
v52103.2%
 
o52103.2%
 
c52103.2%
 
42< 0.1%
 

MonthlyIncome
Categorical

HIGH CARDINALITY

Distinct count1351
Unique (%)5.8%
Missing13
Missing (%)0.1%
Memory size183.2 KiB
2342
 
66
2559
 
54
2380
 
49
5562
 
48
2610
 
48
Other values (1346)
23158
ValueCountFrequency (%) 
2342660.3%
 
2559540.2%
 
2380490.2%
 
5562480.2%
 
2610480.2%
 
6347480.2%
 
2741480.2%
 
6142470.2%
 
2451470.2%
 
3452460.2%
 
2404460.2%
 
2693450.2%
 
7756410.2%
 
2293390.2%
 
5204380.2%
 
3162380.2%
 
5993380.2%
 
4898380.2%
 
9980370.2%
 
2911370.2%
 
2028370.2%
 
10096340.1%
 
2044330.1%
 
5405330.1%
 
3294330.1%
 
Other values (1326)2235595.4%
 

Length

Max length7
Median length4
Mean length4.190305513
Min length3

Overview of Unicode Properties

Unique unicode characters20
Unique unicode categories (?)3
Unique unicode scripts (?)2
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
21321513.5%
 
11171511.9%
 
41100011.2%
 
31028810.5%
 
695259.7%
 
593489.5%
 
786888.8%
 
984368.6%
 
082358.4%
 
877027.8%
 
n27< 0.1%
 
a14< 0.1%
 
r2< 0.1%
 
i2< 0.1%
 
e2< 0.1%
 
M1< 0.1%
 
d1< 0.1%
 
S1< 0.1%
 
g1< 0.1%
 
l1< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number9815299.9%
 
Lowercase Letter500.1%
 
Uppercase Letter2< 0.1%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
21321513.5%
 
11171511.9%
 
41100011.2%
 
31028810.5%
 
695259.7%
 
593489.5%
 
786888.9%
 
984368.6%
 
082358.4%
 
877027.8%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
n2754.0%
 
a1428.0%
 
r24.0%
 
i24.0%
 
e24.0%
 
d12.0%
 
g12.0%
 
l12.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
M150.0%
 
S150.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Common9815299.9%
 
Latin520.1%
 

Most frequent Common characters

ValueCountFrequency (%) 
21321513.5%
 
11171511.9%
 
41100011.2%
 
31028810.5%
 
695259.7%
 
593489.5%
 
786888.9%
 
984368.6%
 
082358.4%
 
877027.8%
 

Most frequent Latin characters

ValueCountFrequency (%) 
n2751.9%
 
a1426.9%
 
r23.8%
 
i23.8%
 
e23.8%
 
M11.9%
 
d11.9%
 
S11.9%
 
g11.9%
 
l11.9%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII98204100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
21321513.5%
 
11171511.9%
 
41100011.2%
 
31028810.5%
 
695259.7%
 
593489.5%
 
786888.8%
 
984368.6%
 
082358.4%
 
877027.8%
 
n27< 0.1%
 
a14< 0.1%
 
r2< 0.1%
 
i2< 0.1%
 
e2< 0.1%
 
M1< 0.1%
 
d1< 0.1%
 
S1< 0.1%
 
g1< 0.1%
 
l1< 0.1%
 

MonthlyRate
Real number (ℝ≥0)

Distinct count1429
Unique (%)6.1%
Missing11
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean14304.343820704376
Minimum2094.0
Maximum26999.0
Zeros0
Zeros (%)0.0%
Memory size183.2 KiB

Quantile statistics

Minimum2094
5-th percentile3395
Q18053
median14222
Q320460
95-th percentile25412
Maximum26999
Range24905
Interquartile range (IQR)12407

Descriptive statistics

Standard deviation7102.636293
Coefficient of variation (CV)0.4965370227
Kurtosis-1.214860841
Mean14304.34382
Median Absolute Deviation (MAD)6204
Skewness0.0194988031
Sum335079254
Variance50447442.31
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
9150530.2%
 
4223490.2%
 
19373340.1%
 
21534330.1%
 
24444320.1%
 
21981320.1%
 
16154320.1%
 
11652320.1%
 
8952320.1%
 
6319320.1%
 
12858320.1%
 
2755320.1%
 
23016320.1%
 
9129320.1%
 
5355320.1%
 
11737320.1%
 
6670320.1%
 
7744320.1%
 
3339320.1%
 
7324320.1%
 
15891320.1%
 
2125320.1%
 
15986320.1%
 
11591320.1%
 
9096320.1%
 
Other values (1404)2258496.4%
 
ValueCountFrequency (%) 
2094160.1%
 
2097150.1%
 
2104160.1%
 
2112150.1%
 
2122160.1%
 
2125320.1%
 
2137160.1%
 
2227160.1%
 
2243160.1%
 
2253160.1%
 
ValueCountFrequency (%) 
26999160.1%
 
26997140.1%
 
26968160.1%
 
26959150.1%
 
26956160.1%
 
26933160.1%
 
26914160.1%
 
26897150.1%
 
26894150.1%
 
26862160.1%
 

NumCompaniesWorked
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct count12
Unique (%)0.1%
Missing9
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean3.887779058351475
Minimum0.0
Maximum23258.0
Zeros3176
Zeros (%)13.6%
Memory size183.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median2
Q34
95-th percentile8
Maximum23258
Range23258
Interquartile range (IQR)3

Descriptive statistics

Standard deviation155.3329041
Coefficient of variation (CV)39.95414909
Kurtosis21486.8869
Mean3.887779058
Median Absolute Deviation (MAD)1
Skewness144.5986932
Sum91079
Variance24128.31111
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1831135.5%
 
0317613.6%
 
3250810.7%
 
223309.9%
 
422089.4%
 
711715.0%
 
611084.7%
 
510024.3%
 
98183.5%
 
87933.4%
 
232581< 0.1%
 
49331< 0.1%
 
(Missing)9< 0.1%
 
ValueCountFrequency (%) 
0317613.6%
 
1831135.5%
 
223309.9%
 
3250810.7%
 
422089.4%
 
510024.3%
 
611084.7%
 
711715.0%
 
87933.4%
 
98183.5%
 
ValueCountFrequency (%) 
232581< 0.1%
 
49331< 0.1%
 
98183.5%
 
87933.4%
 
711715.0%
 
611084.7%
 
510024.3%
 
422089.4%
 
3250810.7%
 
223309.9%
 

Over18
Categorical

HIGH CORRELATION

Distinct count2
Unique (%)< 0.1%
Missing10
Missing (%)< 0.1%
Memory size183.2 KiB
Y
23424
1
 
2
ValueCountFrequency (%) 
Y2342499.9%
 
12< 0.1%
 
(Missing)10< 0.1%
 

Length

Max length3
Median length1
Mean length1.000853388
Min length1

Overview of Unicode Properties

Unique unicode characters4
Unique unicode categories (?)3
Unique unicode scripts (?)2
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
Y2342499.9%
 
n200.1%
 
a10< 0.1%
 
12< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Uppercase Letter2342499.9%
 
Lowercase Letter300.1%
 
Decimal Number2< 0.1%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
Y23424100.0%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
n2066.7%
 
a1033.3%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
12100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin23454> 99.9%
 
Common2< 0.1%
 

Most frequent Latin characters

ValueCountFrequency (%) 
Y2342499.9%
 
n200.1%
 
a10< 0.1%
 

Most frequent Common characters

ValueCountFrequency (%) 
12100.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII23456100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
Y2342499.9%
 
n200.1%
 
a10< 0.1%
 
12< 0.1%
 

OverTime
Categorical

HIGH CORRELATION

Distinct count3
Unique (%)< 0.1%
Missing12
Missing (%)0.1%
Memory size183.2 KiB
No
16790
Yes
6632
Y
 
2
ValueCountFrequency (%) 
No1679071.6%
 
Yes663228.3%
 
Y2< 0.1%
 
(Missing)120.1%
 

Length

Max length3
Median length2
Mean length2.283410138
Min length1

Overview of Unicode Properties

Unique unicode characters7
Unique unicode categories (?)2
Unique unicode scripts (?)1
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
N1679031.4%
 
o1679031.4%
 
Y663412.4%
 
e663212.4%
 
s663212.4%
 
n24< 0.1%
 
a12< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter3009056.2%
 
Uppercase Letter2342443.8%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N1679071.7%
 
Y663428.3%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o1679055.8%
 
e663222.0%
 
s663222.0%
 
n240.1%
 
a12< 0.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin53514100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
N1679031.4%
 
o1679031.4%
 
Y663412.4%
 
e663212.4%
 
s663212.4%
 
n24< 0.1%
 
a12< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII53514100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
N1679031.4%
 
o1679031.4%
 
Y663412.4%
 
e663212.4%
 
s663212.4%
 
n24< 0.1%
 
a12< 0.1%
 

PercentSalaryHike
Categorical

HIGH CORRELATION

Distinct count17
Unique (%)0.1%
Missing14
Missing (%)0.1%
Memory size183.2 KiB
11
3353
13
3345
14
3216
12
3125
15
 
1596
Other values (12)
8787
ValueCountFrequency (%) 
11335314.3%
 
13334514.3%
 
14321613.7%
 
12312513.3%
 
1515966.8%
 
1814086.0%
 
1713125.6%
 
1612475.3%
 
1912145.2%
 
228893.8%
 
208803.8%
 
217673.3%
 
234451.9%
 
243401.5%
 
252831.2%
 
No1< 0.1%
 
Yes1< 0.1%
 
(Missing)140.1%
 

Length

Max length3
Median length2
Mean length2.000640041
Min length2

Overview of Unicode Properties

Unique unicode characters17
Unique unicode categories (?)3
Unique unicode scripts (?)2
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
12393651.1%
 
2761816.2%
 
337908.1%
 
435567.6%
 
518794.0%
 
814083.0%
 
713122.8%
 
612472.7%
 
912142.6%
 
08801.9%
 
n280.1%
 
a14< 0.1%
 
N1< 0.1%
 
o1< 0.1%
 
Y1< 0.1%
 
e1< 0.1%
 
s1< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number4684099.9%
 
Lowercase Letter450.1%
 
Uppercase Letter2< 0.1%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
12393651.1%
 
2761816.3%
 
337908.1%
 
435567.6%
 
518794.0%
 
814083.0%
 
713122.8%
 
612472.7%
 
912142.6%
 
08801.9%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
n2862.2%
 
a1431.1%
 
o12.2%
 
e12.2%
 
s12.2%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N150.0%
 
Y150.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Common4684099.9%
 
Latin470.1%
 

Most frequent Common characters

ValueCountFrequency (%) 
12393651.1%
 
2761816.3%
 
337908.1%
 
435567.6%
 
518794.0%
 
814083.0%
 
713122.8%
 
612472.7%
 
912142.6%
 
08801.9%
 

Most frequent Latin characters

ValueCountFrequency (%) 
n2859.6%
 
a1429.8%
 
N12.1%
 
o12.1%
 
Y12.1%
 
e12.1%
 
s12.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII46887100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
12393651.1%
 
2761816.2%
 
337908.1%
 
435567.6%
 
518794.0%
 
814083.0%
 
713122.8%
 
612472.7%
 
912142.6%
 
08801.9%
 
n280.1%
 
a14< 0.1%
 
N1< 0.1%
 
o1< 0.1%
 
Y1< 0.1%
 
e1< 0.1%
 
s1< 0.1%
 

PerformanceRating
Categorical

HIGH CORRELATION

Distinct count4
Unique (%)< 0.1%
Missing10
Missing (%)< 0.1%
Memory size183.2 KiB
3
19791
4
 
3633
11
 
1
13
 
1
ValueCountFrequency (%) 
31979184.4%
 
4363315.5%
 
111< 0.1%
 
131< 0.1%
 
(Missing)10< 0.1%
 

Length

Max length4
Median length3
Mean length3.000085339
Min length3

Overview of Unicode Properties

Unique unicode characters7
Unique unicode categories (?)3
Unique unicode scripts (?)2
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
.2342633.3%
 
02342633.3%
 
31979228.1%
 
436335.2%
 
n20< 0.1%
 
a10< 0.1%
 
13< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number4685466.6%
 
Other Punctuation2342633.3%
 
Lowercase Letter30< 0.1%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
02342650.0%
 
31979242.2%
 
436337.8%
 
13< 0.1%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
.23426100.0%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
n2066.7%
 
a1033.3%
 

Most occurring scripts

ValueCountFrequency (%) 
Common70280> 99.9%
 
Latin30< 0.1%
 

Most frequent Common characters

ValueCountFrequency (%) 
.2342633.3%
 
02342633.3%
 
31979228.2%
 
436335.2%
 
13< 0.1%
 

Most frequent Latin characters

ValueCountFrequency (%) 
n2066.7%
 
a1033.3%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII70310100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
.2342633.3%
 
02342633.3%
 
31979228.1%
 
436335.2%
 
n20< 0.1%
 
a10< 0.1%
 
13< 0.1%
 
Distinct count4
Unique (%)< 0.1%
Missing8
Missing (%)< 0.1%
Memory size183.2 KiB
3
7316
4
6888
2
4844
1
4380
ValueCountFrequency (%) 
3731631.2%
 
4688829.4%
 
2484420.7%
 
1438018.7%
 
(Missing)8< 0.1%
 

Length

Max length3
Median length3
Mean length3
Min length3

Overview of Unicode Properties

Unique unicode characters8
Unique unicode categories (?)3
Unique unicode scripts (?)2
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
.2342833.3%
 
02342833.3%
 
3731610.4%
 
468889.8%
 
248446.9%
 
143806.2%
 
n16< 0.1%
 
a8< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number4685666.6%
 
Other Punctuation2342833.3%
 
Lowercase Letter24< 0.1%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
02342850.0%
 
3731615.6%
 
4688814.7%
 
2484410.3%
 
143809.3%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
.23428100.0%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
n1666.7%
 
a833.3%
 

Most occurring scripts

ValueCountFrequency (%) 
Common70284> 99.9%
 
Latin24< 0.1%
 

Most frequent Common characters

ValueCountFrequency (%) 
.2342833.3%
 
02342833.3%
 
3731610.4%
 
468889.8%
 
248446.9%
 
143806.2%
 

Most frequent Latin characters

ValueCountFrequency (%) 
n1666.7%
 
a833.3%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII70308100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
.2342833.3%
 
02342833.3%
 
3731610.4%
 
468889.8%
 
248446.9%
 
143806.2%
 
n16< 0.1%
 
a8< 0.1%
 

StandardHours
Categorical

HIGH CORRELATION
HIGH CORRELATION

Distinct count3
Unique (%)< 0.1%
Missing10
Missing (%)< 0.1%
Memory size183.2 KiB
80
23424
3
 
1
4
 
1
ValueCountFrequency (%) 
802342499.9%
 
31< 0.1%
 
41< 0.1%
 
(Missing)10< 0.1%
 

Length

Max length4
Median length4
Mean length3.999487967
Min length3

Overview of Unicode Properties

Unique unicode characters7
Unique unicode categories (?)3
Unique unicode scripts (?)2
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
04685050.0%
 
.2342625.0%
 
82342425.0%
 
n20< 0.1%
 
a10< 0.1%
 
41< 0.1%
 
31< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number7027675.0%
 
Other Punctuation2342625.0%
 
Lowercase Letter30< 0.1%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
04685066.7%
 
82342433.3%
 
41< 0.1%
 
31< 0.1%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
.23426100.0%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
n2066.7%
 
a1033.3%
 

Most occurring scripts

ValueCountFrequency (%) 
Common93702> 99.9%
 
Latin30< 0.1%
 

Most frequent Common characters

ValueCountFrequency (%) 
04685050.0%
 
.2342625.0%
 
82342425.0%
 
41< 0.1%
 
31< 0.1%
 

Most frequent Latin characters

ValueCountFrequency (%) 
n2066.7%
 
a1033.3%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII93732100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
04685050.0%
 
.2342625.0%
 
82342425.0%
 
n20< 0.1%
 
a10< 0.1%
 
41< 0.1%
 
31< 0.1%
 

StockOptionLevel
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct count5
Unique (%)< 0.1%
Missing9
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean0.7998463311563581
Minimum0.0
Maximum80.0
Zeros10066
Zeros (%)43.0%
Memory size183.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q31
95-th percentile3
Maximum80
Range80
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.122453851
Coefficient of variation (CV)1.403336876
Kurtosis2114.866453
Mean0.7998463312
Median Absolute Deviation (MAD)1
Skewness30.4052657
Sum18738
Variance1.259902649
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
01006643.0%
 
1948340.5%
 
2253310.8%
 
313435.7%
 
802< 0.1%
 
(Missing)9< 0.1%
 
ValueCountFrequency (%) 
01006643.0%
 
1948340.5%
 
2253310.8%
 
313435.7%
 
802< 0.1%
 
ValueCountFrequency (%) 
802< 0.1%
 
313435.7%
 
2253310.8%
 
1948340.5%
 
01006643.0%
 

TotalWorkingYears
Real number (ℝ≥0)

Distinct count40
Unique (%)0.2%
Missing8
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean11.259219737066758
Minimum0.0
Maximum40.0
Zeros211
Zeros (%)0.9%
Memory size183.2 KiB

Quantile statistics

Minimum0
5-th percentile1
Q16
median10
Q315
95-th percentile28
Maximum40
Range40
Interquartile range (IQR)9

Descriptive statistics

Standard deviation7.772369689
Coefficient of variation (CV)0.6903115732
Kurtosis0.9267695682
Mean11.25921974
Median Absolute Deviation (MAD)4
Skewness1.117251569
Sum263781
Variance60.40973059
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
10324113.8%
 
619928.5%
 
816537.1%
 
915386.6%
 
513966.0%
 
712855.5%
 
112845.5%
 
49924.2%
 
127573.2%
 
36702.9%
 
156212.6%
 
166142.6%
 
115712.4%
 
135692.4%
 
215402.3%
 
175172.2%
 
144952.1%
 
24852.1%
 
204802.0%
 
184251.8%
 
233541.5%
 
193501.5%
 
223301.4%
 
242881.2%
 
262231.0%
 
Other values (15)17587.5%
 
ValueCountFrequency (%) 
02110.9%
 
112845.5%
 
24852.1%
 
36702.9%
 
49924.2%
 
513966.0%
 
619928.5%
 
712855.5%
 
816537.1%
 
915386.6%
 
ValueCountFrequency (%) 
40370.2%
 
38160.1%
 
37560.2%
 
36920.4%
 
35460.2%
 
34820.3%
 
331100.5%
 
321500.6%
 
311480.6%
 
301050.4%
 

TrainingTimesLastYear
Real number (ℝ≥0)

ZEROS

Distinct count9
Unique (%)< 0.1%
Missing11
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean2.80017075773746
Minimum0.0
Maximum30.0
Zeros871
Zeros (%)3.7%
Memory size183.2 KiB

Quantile statistics

Minimum0
5-th percentile1
Q12
median3
Q33
95-th percentile5
Maximum30
Range30
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.308527349
Coefficient of variation (CV)0.4673026977
Kurtosis10.24969617
Mean2.800170758
Median Absolute Deviation (MAD)1
Skewness1.042542576
Sum65594
Variance1.712243823
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2872537.2%
 
3780633.3%
 
419708.4%
 
518828.0%
 
111264.8%
 
610434.5%
 
08713.7%
 
221< 0.1%
 
301< 0.1%
 
(Missing)11< 0.1%
 
ValueCountFrequency (%) 
08713.7%
 
111264.8%
 
2872537.2%
 
3780633.3%
 
419708.4%
 
518828.0%
 
610434.5%
 
221< 0.1%
 
301< 0.1%
 
ValueCountFrequency (%) 
301< 0.1%
 
221< 0.1%
 
610434.5%
 
518828.0%
 
419708.4%
 
3780633.3%
 
2872537.2%
 
111264.8%
 
08713.7%
 

WorkLifeBalance
Categorical

Distinct count4
Unique (%)< 0.1%
Missing10
Missing (%)< 0.1%
Memory size183.2 KiB
3
14238
2
5479
4
 
2439
1
 
1270
ValueCountFrequency (%) 
31423860.8%
 
2547923.4%
 
4243910.4%
 
112705.4%
 
(Missing)10< 0.1%
 

Length

Max length3
Median length3
Mean length3
Min length3

Overview of Unicode Properties

Unique unicode characters8
Unique unicode categories (?)3
Unique unicode scripts (?)2
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
.2342633.3%
 
02342633.3%
 
31423820.3%
 
254797.8%
 
424393.5%
 
112701.8%
 
n20< 0.1%
 
a10< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number4685266.6%
 
Other Punctuation2342633.3%
 
Lowercase Letter30< 0.1%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
02342650.0%
 
31423830.4%
 
2547911.7%
 
424395.2%
 
112702.7%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
.23426100.0%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
n2066.7%
 
a1033.3%
 

Most occurring scripts

ValueCountFrequency (%) 
Common70278> 99.9%
 
Latin30< 0.1%
 

Most frequent Common characters

ValueCountFrequency (%) 
.2342633.3%
 
02342633.3%
 
31423820.3%
 
254797.8%
 
424393.5%
 
112701.8%
 

Most frequent Latin characters

ValueCountFrequency (%) 
n2066.7%
 
a1033.3%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII70308100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
.2342633.3%
 
02342633.3%
 
31423820.3%
 
254797.8%
 
424393.5%
 
112701.8%
 
n20< 0.1%
 
a10< 0.1%
 

YearsAtCompany
Real number (ℝ≥0)

ZEROS

Distinct count37
Unique (%)0.2%
Missing13
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean7.010886735260215
Minimum0.0
Maximum40.0
Zeros740
Zeros (%)3.2%
Memory size183.2 KiB

Quantile statistics

Minimum0
5-th percentile1
Q13
median5
Q310
95-th percentile20
Maximum40
Range40
Interquartile range (IQR)7

Descriptive statistics

Standard deviation6.138394186
Coefficient of variation (CV)0.8755517551
Kurtosis3.895293291
Mean7.010886735
Median Absolute Deviation (MAD)3
Skewness1.758900923
Sum164216
Variance37.67988318
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
5311213.3%
 
1271811.6%
 
320308.7%
 
220088.6%
 
1019288.2%
 
417557.5%
 
714136.0%
 
912955.5%
 
812755.4%
 
612195.2%
 
07403.2%
 
115142.2%
 
204371.9%
 
133711.6%
 
153111.3%
 
143041.3%
 
222381.0%
 
122210.9%
 
212200.9%
 
182040.9%
 
161980.8%
 
191760.8%
 
171430.6%
 
24950.4%
 
33850.4%
 
Other values (12)4131.8%
 
ValueCountFrequency (%) 
07403.2%
 
1271811.6%
 
220088.6%
 
320308.7%
 
417557.5%
 
5311213.3%
 
612195.2%
 
714136.0%
 
812755.4%
 
912955.5%
 
ValueCountFrequency (%) 
40150.1%
 
37160.1%
 
36320.1%
 
34160.1%
 
33850.4%
 
32480.2%
 
31480.2%
 
30160.1%
 
29300.1%
 
27320.1%
 

YearsInCurrentRole
Real number (ℝ≥0)

ZEROS

Distinct count20
Unique (%)0.1%
Missing15
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean4.227445454933607
Minimum0.0
Maximum22.0
Zeros3925
Zeros (%)16.7%
Memory size183.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q12
median3
Q37
95-th percentile11
Maximum22
Range22
Interquartile range (IQR)5

Descriptive statistics

Standard deviation3.627284157
Coefficient of variation (CV)0.8580321604
Kurtosis0.4853770175
Mean4.227445455
Median Absolute Deviation (MAD)3
Skewness0.9186346507
Sum99011
Variance13.15719035
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2592825.3%
 
0392516.7%
 
7351615.0%
 
321419.1%
 
416357.0%
 
814266.1%
 
910744.6%
 
18873.8%
 
55942.5%
 
65912.5%
 
104592.0%
 
113591.5%
 
132210.9%
 
141760.8%
 
121490.6%
 
151340.6%
 
161100.5%
 
17630.3%
 
18320.1%
 
221< 0.1%
 
(Missing)150.1%
 
ValueCountFrequency (%) 
0392516.7%
 
18873.8%
 
2592825.3%
 
321419.1%
 
416357.0%
 
55942.5%
 
65912.5%
 
7351615.0%
 
814266.1%
 
910744.6%
 
ValueCountFrequency (%) 
221< 0.1%
 
18320.1%
 
17630.3%
 
161100.5%
 
151340.6%
 
141760.8%
 
132210.9%
 
121490.6%
 
113591.5%
 
104592.0%
 

YearsSinceLastPromotion
Real number (ℝ≥0)

ZEROS

Distinct count17
Unique (%)0.1%
Missing11
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean2.183820704375667
Minimum0.0
Maximum17.0
Zeros9271
Zeros (%)39.6%
Memory size183.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q33
95-th percentile9
Maximum17
Range17
Interquartile range (IQR)3

Descriptive statistics

Standard deviation3.218614665
Coefficient of variation (CV)1.473845659
Kurtosis3.623118393
Mean2.183820704
Median Absolute Deviation (MAD)1
Skewness1.986932469
Sum51156
Variance10.35948036
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0927139.6%
 
1568424.3%
 
2253310.8%
 
712155.2%
 
49614.1%
 
38433.6%
 
57133.0%
 
65082.2%
 
113771.6%
 
82841.2%
 
92711.2%
 
152060.9%
 
121600.7%
 
131580.7%
 
141440.6%
 
10960.4%
 
171< 0.1%
 
(Missing)11< 0.1%
 
ValueCountFrequency (%) 
0927139.6%
 
1568424.3%
 
2253310.8%
 
38433.6%
 
49614.1%
 
57133.0%
 
65082.2%
 
712155.2%
 
82841.2%
 
92711.2%
 
ValueCountFrequency (%) 
171< 0.1%
 
152060.9%
 
141440.6%
 
131580.7%
 
121600.7%
 
113771.6%
 
10960.4%
 
92711.2%
 
82841.2%
 
712155.2%
 

YearsWithCurrManager
Real number (ℝ≥0)

ZEROS

Distinct count18
Unique (%)0.1%
Missing7
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean4.12757693456827
Minimum0.0
Maximum17.0
Zeros4197
Zeros (%)17.9%
Memory size183.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q12
median3
Q37
95-th percentile10
Maximum17
Range17
Interquartile range (IQR)5

Descriptive statistics

Standard deviation3.572379438
Coefficient of variation (CV)0.865490697
Kurtosis0.1609622479
Mean4.127576935
Median Absolute Deviation (MAD)3
Skewness0.8316199248
Sum96705
Variance12.76189485
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2546123.3%
 
0419717.9%
 
7343914.7%
 
322549.6%
 
816967.2%
 
415726.7%
 
112195.2%
 
910354.4%
 
54962.1%
 
64571.9%
 
104351.9%
 
113511.5%
 
122771.2%
 
132381.0%
 
171120.5%
 
14790.3%
 
15790.3%
 
16320.1%
 
(Missing)7< 0.1%
 
ValueCountFrequency (%) 
0419717.9%
 
112195.2%
 
2546123.3%
 
322549.6%
 
415726.7%
 
54962.1%
 
64571.9%
 
7343914.7%
 
816967.2%
 
910354.4%
 
ValueCountFrequency (%) 
171120.5%
 
16320.1%
 
15790.3%
 
14790.3%
 
132381.0%
 
122771.2%
 
113511.5%
 
104351.9%
 
910354.4%
 
816967.2%
 

Employee Source
Categorical

HIGH CORRELATION

Distinct count12
Unique (%)0.1%
Missing12
Missing (%)0.1%
Memory size183.2 KiB
Company Website
5400
Seek
3689
Indeed
2529
Jora
2422
LinkedIn
2339
Other values (7)
7045
ValueCountFrequency (%) 
Company Website540023.0%
 
Seek368915.7%
 
Indeed252910.8%
 
Jora242210.3%
 
LinkedIn233910.0%
 
Recruit.net23229.9%
 
GlassDoor21769.3%
 
Adzuna21269.1%
 
Referral4181.8%
 
Test1< 0.1%
 
21< 0.1%
 
151< 0.1%
 
(Missing)120.1%
 

Length

Max length15
Median length8
Mean length8.559438471
Min length1

Overview of Unicode Properties

Unique unicode characters35
Unique unicode categories (?)5
Unique unicode scripts (?)2
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
e3105615.5%
 
n170798.5%
 
a125546.3%
 
o121746.1%
 
i100615.0%
 
t100455.0%
 
s97534.9%
 
d95234.7%
 
r77563.9%
 
k60283.0%
 
C54002.7%
 
m54002.7%
 
p54002.7%
 
y54002.7%
 
54002.7%
 
W54002.7%
 
b54002.7%
 
I48682.4%
 
u44482.2%
 
S36891.8%
 
R27401.4%
 
l25941.3%
 
J24221.2%
 
L23391.2%
 
c23221.2%
 
Other values (10)113485.7%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter15953779.5%
 
Uppercase Letter3333716.6%
 
Space Separator54002.7%
 
Other Punctuation23221.2%
 
Decimal Number3< 0.1%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
C540016.2%
 
W540016.2%
 
I486814.6%
 
S368911.1%
 
R27408.2%
 
J24227.3%
 
L23397.0%
 
G21766.5%
 
D21766.5%
 
A21266.4%
 
T1< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
e3105619.5%
 
n1707910.7%
 
a125547.9%
 
o121747.6%
 
i100616.3%
 
t100456.3%
 
s97536.1%
 
d95236.0%
 
r77564.9%
 
k60283.8%
 
m54003.4%
 
p54003.4%
 
y54003.4%
 
b54003.4%
 
u44482.8%
 
l25941.6%
 
c23221.5%
 
z21261.3%
 
f4180.3%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
5400100.0%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
.2322100.0%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
1133.3%
 
5133.3%
 
2133.3%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin19287496.1%
 
Common77253.9%
 

Most frequent Latin characters

ValueCountFrequency (%) 
e3105616.1%
 
n170798.9%
 
a125546.5%
 
o121746.3%
 
i100615.2%
 
t100455.2%
 
s97535.1%
 
d95234.9%
 
r77564.0%
 
k60283.1%
 
C54002.8%
 
m54002.8%
 
p54002.8%
 
y54002.8%
 
W54002.8%
 
b54002.8%
 
I48682.5%
 
u44482.3%
 
S36891.9%
 
R27401.4%
 
l25941.3%
 
J24221.3%
 
L23391.2%
 
c23221.2%
 
G21761.1%
 
Other values (5)68473.5%
 

Most frequent Common characters

ValueCountFrequency (%) 
540069.9%
 
.232230.1%
 
11< 0.1%
 
51< 0.1%
 
21< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII200599100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
e3105615.5%
 
n170798.5%
 
a125546.3%
 
o121746.1%
 
i100615.0%
 
t100455.0%
 
s97534.9%
 
d95234.7%
 
r77563.9%
 
k60283.0%
 
C54002.7%
 
m54002.7%
 
p54002.7%
 
y54002.7%
 
54002.7%
 
W54002.7%
 
b54002.7%
 
I48682.4%
 
u44482.2%
 
S36891.8%
 
R27401.4%
 
l25941.3%
 
J24221.2%
 
L23391.2%
 
c23221.2%
 
Other values (10)113485.7%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

AgeAttritionBusinessTravelDailyRateDepartmentDistanceFromHomeEducationEducationFieldEmployeeCountEmployeeNumberApplication IDEnvironmentSatisfactionGenderHourlyRateJobInvolvementJobLevelJobRoleJobSatisfactionMaritalStatusMonthlyIncomeMonthlyRateNumCompaniesWorkedOver18OverTimePercentSalaryHikePerformanceRatingRelationshipSatisfactionStandardHoursStockOptionLevelTotalWorkingYearsTrainingTimesLastYearWorkLifeBalanceYearsAtCompanyYearsInCurrentRoleYearsSinceLastPromotionYearsWithCurrManagerEmployee Source
041.0Voluntary ResignationTravel_Rarely1102.0Sales12.0Life Sciences111234562.0Female943.02.0Sales Executive4Single599319479.08.0YYes113.01.080.00.08.00.01.06.04.00.05.0Referral
141.0Voluntary ResignationTravel_Rarely1102.0Sales12.0Life Sciences111234582.0Female943.02.0Sales Executive4Single599319479.04.0YYes113.01.080.00.08.00.01.06.04.00.05.0Referral
241.0Voluntary ResignationTravel_Rarely1102.0Sales12.0Life Sciences171234622.0Female943.02.0Sales Executive4Single599319479.08.0YYes113.01.080.00.08.00.01.06.04.00.05.0Referral
341.0Voluntary ResignationTravel_Rarely1102.0Sales12.0Life Sciences181234632.0Female943.02.0Sales Executive4Single599319479.04.0YYes113.01.080.00.08.00.01.06.04.00.05.0Referral
441.0Voluntary ResignationTravel_Rarely1102.0Sales12.0Life Sciences191234642.0Female943.02.0Sales Executive4Single599319479.08.0YYes113.01.080.00.08.00.01.06.04.00.05.0Referral
541.0Voluntary ResignationTravel_Rarely1102.0Sales12.0Life Sciences1101234654.0Female333.04.0Manager3Divorced1475619730.02.0YYes143.03.080.03.021.02.03.05.00.00.02.0Company Website
641.0Voluntary ResignationTravel_Rarely1102.0Sales12.0Life Sciences1111234661.0Female413.05.0Manager1Married195663854.05.0YNo113.04.080.00.033.05.01.029.08.011.010.0Indeed
741.0Voluntary ResignationTravel_Rarely1102.0Sales12.0Life Sciences1131234682.0Female943.02.0Sales Executive4Single599319479.04.0YYes113.01.080.00.08.00.01.06.04.00.05.0Referral
841.0Voluntary ResignationTravel_Rarely1102.0Sales12.0Life Sciences1171234722.0Female943.02.0Sales Executive4Single599319479.08.0YYes113.01.080.00.08.00.01.06.04.00.05.0Referral
941.0Voluntary ResignationTravel_Rarely1102.0Sales12.0Life Sciences1181234732.0Female943.02.0Sales Executive4Single599319479.04.0YYes113.01.080.00.08.00.01.06.04.00.05.0Referral

Last rows

AgeAttritionBusinessTravelDailyRateDepartmentDistanceFromHomeEducationEducationFieldEmployeeCountEmployeeNumberApplication IDEnvironmentSatisfactionGenderHourlyRateJobInvolvementJobLevelJobRoleJobSatisfactionMaritalStatusMonthlyIncomeMonthlyRateNumCompaniesWorkedOver18OverTimePercentSalaryHikePerformanceRatingRelationshipSatisfactionStandardHoursStockOptionLevelTotalWorkingYearsTrainingTimesLastYearWorkLifeBalanceYearsAtCompanyYearsInCurrentRoleYearsSinceLastPromotionYearsWithCurrManagerEmployee Source
2342660.0Current employeeTravel_Rarely370.0Research & Development14.0Life Sciences1193321427873.0Male921.03.0Healthcare Representative4Divorced1088320467.00.0YNo203.03.080.01.019.02.04.01.00.00.00.0Company Website
2342760.0Current employeeTravel_Rarely370.0Research & Development14.0Medical1193361427913.0Male921.03.0Healthcare Representative4Single1088320467.03.0YNo204.03.080.01.020.02.03.020.07.02.013.0Company Website
2342860.0Current employeeTravel_Rarely370.0Research & Development14.0Life Sciences1193371427923.0Male921.03.0Healthcare Representative4Divorced1088320467.00.0YNo204.03.080.01.019.02.04.01.00.00.00.0Company Website
2342960.0Current employeeTravel_Rarely370.0Research & Development14.0Medical1193381427933.0Male921.03.0Healthcare Representative4Divorced1088320467.03.0YNo203.03.080.01.019.02.04.01.00.00.00.0Company Website
2343060.0Current employeeTravel_Rarely370.0Research & Development14.0Life Sciences1193401427953.0Male921.03.0Healthcare Representative4Divorced1088320467.00.0YNo203.03.080.01.019.02.04.01.00.00.00.0Company Website
2343160.0Current employeeTravel_Rarely370.0Research & Development14.0Medical1193441427993.0Male921.03.0Healthcare Representative4Single1088320467.03.0YNo204.03.080.01.020.02.03.020.07.02.013.0Company Website
2343260.0Current employeeTravel_Rarely370.0Research & Development14.0Life Sciences1193451428003.0Male921.03.0Healthcare Representative4Divorced1088320467.00.0YNo204.03.080.01.019.02.04.01.00.00.00.0Company Website
23433NaNVoluntary ResignationTravel_Frequently1009.0Research & Development13.0Life Sciences1167941402494.0Male833.02.0Sales Executive3Married53012939.04.0YNo153.03.080.02.04.02.02.02.01.02.02.0Adzuna
23434NaNCurrent employeeTravel_Rarely1354.0Research & Development53.0Medical119561254113.0Female452.03.0Manager1Single116315615.02.0YNo123.04.080.00.014.06.03.011.010.05.08.0Indeed
23435NaNCurrent employeeNon-Travel1142.0Research & Development82.0Life Sciences1175871410424.0Male723.02.0Healthcare Representative4Divorced40698841.03.0YYes183.03.080.00.08.02.03.02.02.02.02.0Recruit.net

Duplicate rows

Most frequent

AgeAttritionBusinessTravelDailyRateDepartmentEducationEducationFieldEmployeeNumberEnvironmentSatisfactionGenderHourlyRateJobInvolvementJobLevelJobRoleJobSatisfactionMaritalStatusMonthlyIncomeMonthlyRateNumCompaniesWorkedOver18OverTimePercentSalaryHikePerformanceRatingRelationshipSatisfactionStandardHoursStockOptionLevelTotalWorkingYearsTrainingTimesLastYearWorkLifeBalanceYearsAtCompanyYearsInCurrentRoleYearsSinceLastPromotionYearsWithCurrManagerEmployee Sourcecount
021.0Current employeeTravel_Rarely391.0Research & Development2.0Life Sciences8161.0Male391.01.0Laboratory Technician3Single229310558.01.0YNo124.03.080.00.01.02.02.01.00.00.01.0Seek2
121.0Current employeeTravel_Rarely391.0Research & Development2.0Life Sciences8193.0Male391.01.0Laboratory Technician3Single22937324.01.0YNo123.03.080.00.01.02.02.01.00.00.01.0Seek2
221.0Current employeeTravel_Rarely391.0Research & Development2.0Medical8171.0Male391.01.0Laboratory Technician3Single229310558.01.0YNo123.03.080.00.01.02.02.01.00.00.01.0Seek2
326.0Voluntary ResignationTravel_Rarely1357.0Research & Development3.0Life Sciences8131.0Male481.01.0Laboratory Technician3Single229321534.01.0YNo123.03.080.00.01.02.02.01.00.00.01.0Seek2
426.0Voluntary ResignationTravel_Rarely1357.0Research & Development3.0Life Sciences8151.0Male481.01.0Laboratory Technician3Single229310558.01.0YNo123.03.080.00.01.02.02.01.00.00.01.0Seek2
526.0Voluntary ResignationTravel_Rarely1357.0Research & Development3.0Technical Degree8141.0Male481.01.0Laboratory Technician3Single229326009.01.0YNo123.03.080.00.01.02.02.01.00.00.01.0Seek2
626.0Voluntary ResignationTravel_Rarely1357.0Research & Development3.0Technical Degree8181.0Male481.01.0Laboratory Technician3Single229310558.01.0YNo123.03.080.00.01.02.02.01.00.00.01.0Seek2
732.0Current employeeNon-Travel953.0Research & Development4.0Life Sciences232444.0Female1002.02.0Sales Executive4Married665214369.05.0YNo133.01.080.01.08.02.02.06.03.00.00.0Seek2
837.0Voluntary ResignationTravel_Rarely807.0Human Resources4.0Human Resources11.0Female373.02.0Sales Executive4Single599319479.08.0YYes114.01.080.00.08.00.01.06.04.00.05.0Referral2
937.0Voluntary ResignationTravel_Rarely807.0Human Resources4.0Human Resources51.0Female373.02.0Sales Executive4Single599319479.08.0YYes113.01.080.00.08.00.01.06.04.00.05.0Referral2